Iterative procedure for computing accessible information in quantum communication 
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We present an iterative algorithm that finds the optimal measurement for extracting the accessible 
information in any quantum communication scenario. The maximization is achieved by a steepest- 
ascent approach toward the extremal point, following the gradient uphill in sufficiently small steps. 
We apply it to a simple ad-hoc example, as well as to a problem with a bearing on the security of 
a tomographic protocol for quantum key distribution. 



When studying problems in quantum information sci- 
ence, such as investigating strategies for eavesdropping 
on quantum communication lines or calculating the ca- 
pacity of a quantum channel, one often needs to search 
for the quantum measurement that is optimal for the pur- 
pose at hand. Typically, the quantity to be maximized is 
the accessible information associated with a set of states. 
We describe here a numerical procedure for finding this 
optimum, and illustrate it with two examples. 

We consider the following quantum communication 
scenario. Alice sends quantum objects to Bob in one of 
J states, specified by subnormalized statistical operators 
Pj (j = 1, . . . , J) whose traces tr {pj} are the probabili- 
ties which which they are sent. The state in which Bob 
receives the objects is therefore given by 

J 

P = ^Pi with tr M = 1 • (!) 

3=1 

He performs a positive-operator valued measurement 
(POVM), composed of K positive operators life that de- 
compose the identity, 

K 

n fc = i with n fe > o , (2) 

fe=i 

which he wishes to choose such that the joint probabilities 
Pjk = tr {pjU k } , J2 p i k = 1 ' ( 3 ) 

are most informative about the pj that happens to be the 
case for the particular quantum object under considera- 
tion. Bob's figure of merit is the accessible information 

where pj. and p.k are the marginal probabilities, 

pj- = J2 p J k = tr &j} ' pk = J2 p j k = tr { pIl j} ■ 

k j 

(5) 

Here, pj. is the probability that Alice sends the jth state, 
p.k is the probability that Bob gets the fcth measurement 



outcome, and pjk is the probability that she sends the 
jth state and he gets the kth outcome. 

Since Alice's p/s are given, the accessible informa- 
tion of Eq. Q is a nonlinear convex functional of Bob's 
POVM, H = {III, ...,11*:} -> /(H). The convexity, 

i(n^) < (i - A)i(nW) + A/(n( 2 ') (6) 

for U ( k X) = (1 - A)n[ 1) + An^ 2) with < A < 1, follows 
from 

(lr)V A) ) = E { ' k r~ rk3k) >o, (7) 

\d\J K ' £f 2p jkPj , k p. k w 

where p jk = (1 - X)pf k ) + Xpf k \ 

Now, since I(Jl) is convex, it acquires its global max- 
imum at the boundary of the convex set of all POVMs. 
The challenge is then to find the maximizing POVM, and 
it is the objective of this article to describe an iteration 
procedure for an efficient numerical search. 

We observe that 7(11) = J2k tr { R k^k} with 

and the response of 7(n) to a variation of the POVM 
is given by SI = J^k t r {RkSHk} because there is no net 
contribution from the induced changes of the RkS. The 
variations 5Hk are subject to the constraints of J2Jl, which 
we enforce by first writing Ilk = A\A k and then imposing 

Y,6Ii k = ^2(AiSA k + SA{A k ) = (9) 

k k 

on the variations SA k . The most general form for these 
variations is 

8A k =%Y^ tkiAi with e f kl = e lk , (10) 
I 

where the e fci 's are arbitrary infinitesimal operators. So, 
5I = iJ2^{e kl A l {R k -R l )Al} , (11) 

k,l 



2 



and the POVMs at the stationary points of /(II) must 
necessarily be such that A x R k A\ = A l R l A k or 



UiR k U k = UiRiU k for all k, 



(12) 



Upon summing over k or I we arrive at an equivalent 
set of equations, 

ILA = WiRi , R k n k = AU k , (13) 

which are adjoint statements of each other because 



(14) 



is hermitian. Mathematically speaking, A is the La- 
grange multiplier of the constraint and its physical 
significance is revealed by noting that I = tr {A}. 

The numerical procedure for finding the solution of 
these equations is an iteration method that realizes a 
steepest ascend toward the maximum of /(II). In each 
round we proceed in the direction of the gradient by 
putting e k i — —iaA k (R k — Ri)A\ in ljTU|) whereby a > 
controls the step size. As an immediate consequence of 
Eq. (|lip. the first-order (in a) change of /(II) is then 
assuredly positive, 



dl_ 

da 



a=0 



= J2^i( R k-Ri)^k(R k -Ri)^i}>0, (15) 

k,l 



and vanishes only at the stationary points where Eq. 112|) 
holds. But we must correct for the second-order terms 
that give a nonzero value to the sum in @. 

A round of the iteration procedure thus consists of the 
following three steps. 

Step 1: Use the existing approximate POVM 

to calculate pj k , p. k , and R k in accordance 

with Eqs. J3J), ©, and ©. 

Step 2: Next, choose a "small" positive value 

for a in G k = 1 + a(R k — J2i Ri^li) an d com- (iq\ 

pute n fe =G[n k G k . 

Step 3: Finally, sum up these H k , S — 11;, 
and take S^^tikS^ 1 / 2 as the new, improved 
approximation for life. 

We evaluate /(II) at the end of each round to verify that 
an acceptable value for a was chosen in Step 2. A de- 
crease of /, rather than an increase, would indicate an 
overshooting and thus tell us that a was too large pj. 

The reciprocal square root of S that is needed in Step 3 
can be computed easily and efficiently by a fixpoint iter- 
ation, such as the one specified by 



X 



n+l 



-x,„ 



1 



X n (SX n + X n S)X ri 



(17) 



Starting with X = 1, the sequence of X n converges 
rapidly to S^ 1 / 2 , provided that all eigenvalues of S are 



less than 3. This condition is surely met if a is small 
enough because S — 1 is positive and proportional to a 2 . 

Any randomly chosen POVM can be used as the start- 
ing point for the iteration Ijl6|) , except for the maximally 
ignorant POVMs for which each member is simply a mul- 
tiple of the identity, life = p. k . Since pj k — Pj.p-k for 
these POVMs, they result in /(II) = and thus mark 
the global minima of the accessible information. These 
minima are unstable fixpoints of the iteration (|16|l , and 
so it is enough to perturb them slightly by admixing a 
small fraction of a randomly generated POVM. 

Prior to any iteration, a choice must be made for the 
value of K, that is the number of elements in the POVM. 
According to a theorem by Davies 0, one never needs 
more than K — r 2 members, where r is the rank of p. 
But very often fewer elements will do. For example, if it 
is possible to represent all pj by real matrices, then K = 
ir(r+l) members suffice, as Sasaki et al. have shown 
A particular example that has a bearing on the security 
analysis of certain schemes for quantum cryptography 
are rank-1 p^-'s that form a so-called "acute pyramid", 
for which one knows that K = J or K = J + 1 will do, 
depending on the volume of the pyramid Q. 



One approach is, therefore, to choose K 



for 



general /jj's and K = \r{r + 1) if the pj's have joint 
real matrix representations. After the iterative opti- 
mization, one would then look for equivalent members 
and combine them into one new member, thereby reduc- 
ing the value of K. Members IT^ and llfc 2 are equiv- 
alent if Pjk 1 Pj'k 2 — Pj'kiPjk 2 f° r a U j an d j', for then 
Rk! = Rk 2 : an d the pair (ll kl + U k2 , 0) is as good as the 
pair (n fcl ,IT fc2 ). 

Another approach is to begin with a small K value, 
with K — J suggesting itself. Then, after optimizing for 
this K, one would add a randomly chosen II^ + i and so 
increase K by 1, with the proper normalization to unit 
sum achieved analogously to Step 3 of (jpp|l . The optimal 
POVM has been found when the increase of K becomes 
virtual, i.e., when the eventual reduction of equivalent 
members decreases K. 

We note that the iteration scheme (|16f> can be used for 
the optimization of other functionals as well, as long as 
there are given pj's and a procedure for calculating the 
R k 's from the Pj's and II^'s. Indeed, the set of R k s is 
the functional gradient of /(II) and can be regarded as 
defining /(II) up to a II-independent constant. 

An example is Helstrom's classic problem of minimum- 
error discrimination , where one has K — J and wishes 
to maximize Y], pjj. Therefore, we have R k = p k for Hel- 
strom's problem and can iterate as described above with- 
out further ado. In view of the very different Z?fe's, the 
optimal POVM for minimum-error discrimination will, 
as a rule, be different from the POVM that maximizes 
the accessible information of Eq. (0}. Especially, when 
the states to be discriminated are pure states that are 
nearly collinear, tr {pjPj>} ^ Pj.pj'., p is almost a pure 
state itself, and this tends to bias the optimal POVM for 
accessible information away from the one that minimizes 
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TABLE I: Hellstrom's Success Rate (SR) J2 Pn for the ex- 
ample of 1)18^ . as obtained for a particular iteration sequence 
that begins with a randomly chosen Oth POVM, given for 
various Numbers of Iteration Rounds (NIR). The right col- 
umn lists the Accessible Information (AI) obtained for the 
successive POVMs that optimize SR iteratively. 



NIR 




10 
20 
30 
10 
50 
oo 



SR 



0.5 

0.838 445 2747 
0.840 825 0481 
0.840 886 8797 
0.840 888 4135 
0.840 888 4515 
0.840 888 4524 



AI 



0.0 

0.434 433 2664 
0.447 716 5615 
0.448 0814579 
0.448 090 5243 
0.448 090 7489 
0.448 090 7546 



are to be distinguished by the eavesdropper constitute a 
highly symmetric family of quantum states. The optimal 
POVM then also tends to be relatively simple, which 
makes it sometimes possible to reconstruct its analytical 
form from the result of the numerical search. 

As an illustration, we consider the qubit version of the 
fully tomographic protocols for quantum key distribution 
of Ref . |(| , an extension of the classic 4-state protocol of 
Bennett and Brassard 0, or rather its 6-state general- 
ization ||, with some features of Ekert's protocol 
which itself involves partial tomography. In this proto- 
col, eavesdropper Eve controls the source that distributes 
qubit pairs to the Alice and Bob, one pair at a time, in 
such a way that the statistical operator of these pairs is 
the singlet state |i/>) with an admixture of unbiased noise, 



the discrimination error. This is well illustrated by the 
acute pyramids mentioned above 

As a simple example let us consider J = 2 and pi, pi 
with the following matrix representations, 



Pr- 




P2-- 




(18) 



These are rank-2 matrices with r = 3 for the rank of p = 
Pi + pi- The Helstrom problem is solved by ni and n2 
projecting on the subspaces associated with the positive 
and negative eigenvalues of p\ — P2, respectively, so that 
ma,xJ2jPn = 0.840 888 4524. As shown in Tabled the 
iteration converges to this value in a few dozen rounds. 

The respective iterations for maximizing the accessible 
information are summarized in Fig.^for K = 2 and K = 
3. No improvement is found for K — 4,5, or 6, so that the 
optimal POVM has three members in this example. No 
attempt has been made, for Table[I]or Fig.^ to optimize 
parameter a of (|16|) . A judicious choice could reduce the 
necessary number of iteration rounds by much. 

The algorithm of l|16|) provides numerical answers 
for the accessible information and identifies the opti- 
mal POVMs. This is already sufficient for many quan- 
tum information tasks, such as the checking of the secu- 
rity of noisy quantum communication, or the calculation 
of the maximal information yield per quantum system 
sent. The applications are not limited to these, how- 
ever. Sometimes it is possible, upon restricting the set 
of measurements over which the accessible information 
is maximized to some special class of measurements, to 
derive analytical forms for the optimal POVM. But its 
global optimality can often be only conjectured, without 
having a solid proof. Our algorithm can be used, in such 
cases, to check the optimality of the POVM found. Such 
checks were indeed performed for the low-dimensional 
"pyramids" of Ref. 0. 

Another, even more important potential application is 
the use of the algorithm for finding the analytical solu- 
tions to optimization problems. For instance, it is typical 
in quantum communication protocols that the states that 



Pa&b = - e )(V>| 



e 
4 



(19) 



with < e < 1. 

Both Alice and Bob measure the six-element POVM 
consisting of the rank-1 operators i(l ± a^) with ( = 
x,y, z, where the cr^'s are the respective basic Pauli op- 
erators for their qubits. The mutual information of the 
resulting joint probabilities is 



A&B 



elog 2 e+(2-e) log 2 (2-e) 



(20) 



where we now adopt the conventions of information the- 
ory and employ the binary logarithm rather than the 
natural logarithm that is more convenient in Eqs. — 
(El). 
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FIG. 1: Values of the accessible information for the example 
of Eq. 1181 . as a function of the number of iteration rounds. 
Thin dashed horizontal line a shows the value obtained for 
the POVM that optimizes Hellstrom's success rate, see Ta- 
ble □ Thick dashed line b is for K = 2, with the POVM of 
line a as the Oth approximation. The optimal POVM is com- 
posed of a rank-1 projector and a rank-2 projector. Curves 
c and c' are for K = 3 with different random choices for the 
Oth approximation. The optimal POVM is a von Neumann 
measurement, i.e., it is composed of three rank-1 projectors. 
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At the source, each pair is entangled with an ancilla, 
which remains in Eve's possession as a quantum record 
of what has been sent to Alice and Bob. As shown in 
Ref. (see also the appendix in Q), the best Eve can 
do is to use another qubit pair for the ancilla and prepare 
the pure state 



|*) = 1^12^34)0+ 1^13^24)6; 



(21) 



where qubits 1 and 2 are Alice's and Bob's, respec- 
tively, qubits 3 and 4 make up the ancilla, and ipjk 
means the singlet for qubits j and k. The constraint 
tr 3 4 { I *)(*|} = /Oa&b requires 



2a + 6 



3e, 



(22) 



and it is convenient to choose the arbitrary complex 
phases of a and b such that e = (2a + b)b* = v4e — 3e 2 
is positive. 

For each of Alice's (or Bob's) six measurement results 
there is a corresponding ancilla state, 



PC± = 



1 

24 



(1- 



? (3).^(4) 



(23) 

with £ = x,y, z. For Eve it is, therefore, as if Alice 
were sending her these subnormalized statistical opera- 
tors, and what Eve can know about Alice's measurement 
results is measured by the accessible information associ- 
ated with this sextet. 

Eve's optimal POVM can be found numerically by the 
iteration procedure of l|l(jfl . One finds that it has six 
elements which, upon careful inspection of the numerical 
results, arc identified as the rank-1 operators 



n, 



it 



n/3, 



7 (4 ); 



(3) 
C 1 



.(4) 
C 



1 



!(3) . ^(4) 



with £ = x,y,z. Note the remarkable simplicity of the 
optimal POVM: it has relatively few elements, and does 
not depend on the noise parameter e. This is clearly a 
consequence of the symmetry of the sextet <|23|) . which 
consists of unitarily equivalent rank-2 operators. 



The accessible information gained by Eve from this 
POVM is given by 



Ia&e = Ia&b (e — > 1 



(25) 



with Ja&b( c ) as m Eq. (|2C)|> . Accordingly, the critical e 
value, for which Ia&e = Xa&b, is given by 



^crit 



(I 



V3 



0.2363. 



(26) 



For e < e cr ;t, we have Ia&b > Xa.&e, so that the informa- 
tion that Bob has about Alice's bit values exceeds Eve's 
information about them. Then, the Csiszar-Korner the- 
orem [lOj ensures that they can generate a private, secure 
cryptographic key by one-way communication. 

We must not fail to mention that the POVM of Eq. 
is only optimal for e < |, that is in the interesting pa- 
rameter range where Pa&b is not separable and the joint 
probabilities between Alice and Bob contain nonclassi- 
cal correlations. For e > |, Eve can blend /?a&b from 
product states and thereby obtain /a&e = \ right away. 

In conclusion, we have presented an iterative numeri- 
cal procedure for finding the POVM that optimally ex- 
tracts the accessible information from a given set of states 
received in a quantum communication scenario. The 
method is a steepest-ascend approach toward the maxi- 
mum; it follows the gradient in steps that are sufnently 
small to avoid over-shooting, so that the accessible in- 
formation increases monotonically in each iteration step. 
We have illustrated the method at a simple ad-hoc exam- 
ple. A second example, which has a bearing on quantum 
key distribution, shows how the analytical answer can be 
established, once crucial insight is gained from the nu- 
merical solution. 
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